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ABSTRACT 


Darwin’s panoramic view of biology encompassed two metaphors: the phylogenetic tree, pointing to relatively linear (and 
divergent) complexity, and the tangled bank, pointing to reticulated (and convergent) complexity. The emergence of 
phylogenetic systematics half a century ago made it possible to investigate linear complexity in biology. Assumption 0, first 
proposed in 1986, is not needed for cases of simple evolutionary patterns, but must be invoked when there are complex 
evolutionary patterns whose hallmark is reticulated relationships. A corollary of Assumption 0, the duplication convention, 
was proposed in 1990, permitting standard phylogenetic systematic ontology to be used in discovering reticulated evolutionary 
histories. In 2004, a new algorithm, phylogenetic analysis for comparing trees (PACT), was developed specifically for use in 
analyses invoking Assumption 0. PACT can help discern complex evolutionary explanations for historical biogeographical. 
coevolutionary, phylogenetic, and tokogenetic processes. 
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The universe is structured by laws, and science is the 
search for methods of data analysis and theories 
providing powerful general explanations, all in terms of 
those general laws. This ontology of simplicity has 
guided the development of western science for nearly 
2500 years, embodied in the principle of parsimony 
(Latin “parcere,” to spare). Aristotle (350 B.C.E.) 
articulated the ontological view of the principle of 
parsimony, the postulate that “nature operates in the 
shortest. way possible” and “the more limited, if 
adequate, is always preferable.” The principle of 
parsimony is also linked with the English philosopher 
and Franciscan monk William of Ockham (ca. 1285- 
1349), who advocated the use of what is known as 
“Ockham’s razor”: “pluralitas non est ponenda sine 
neccesitate” (“plurality should not be posited without 
necessity”) and “non sunt multiplicanda entia praeter 
necessitatem” (“entities should not be multiplied 
unnecessarily”). In this sense, the principle of simplic- 
ity obliges us to favor theories or hypotheses that make 
the fewest unwarranted, or ad hoc, assumptions about 
the data from which they are derived. This epistemo- 
logical use of the principle does not necessarily imply 
that nature itself is parsimonious. Indeed, despite the 
best efforts of philosophers for more than 700 years, no 
link between parsimony and truth has ever been 
established. Nonetheless, most scientists conduct their 
research as if they believe that nature is parsimonious in 
some sense and, as a consequence, they place much 
more credence in simple than in complex theories. 

The cosmologist Stephen Hawking has dubbed the 
21st century the century of complexity, leading a 
parade of physicists who began to think about 
ontological complexity in the latter decade of the 
20th century. Biology should be well poised to take 
advantage of this sea change in our understanding of 
the nature of science and the universe, having 
discovered an ontology of complexity called Darwin- 
ism almost 150 years ago. Few fully realize that 
Darwinism is not a simple theory: 


there are two factors: namely, the nature of the 
organism and the nature of the conditions. The former 
seems to be much more the important; for nearly 
similar variations sometimes arise under, as far as we 
can judge, dissimilar conditions; and, on the other 
hand, dissimilar variations arise under conditions 
which appear to be nearly uniform.” 

—Darwin (1872: 32) 


Most evolutionary biologists consider this passage 
no more than a general repudiation of Lamarckism. 
We believe, however, that it is one of the first modern 
articulations of a complex theory: Darwin proposed 
that evolution is an emergent property of asymmetrical 
interactions between two different causal agents, each 
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with its own properties, producing outcomes that are 
not readily predictable from knowledge of the 
properties of either agent and, thus, historically 
contingent. 

Darwin thought that organisms were historically 
and developmentally cohesive wholes, and it was in 
the “nature of the organism” to produce offspring that 
were all highly similar (but not identical) to each other 
and to their parents and other ancestors. He also 
that 
without regard for environmental conditions, and it 


postulated reproduction produced variation 
was in the “nature of the organism” to produce these 
offspring in numbers far exceeding the resources 


When this 


overproduction produced variety in critical charac- 


available for their support. inherent 
ters, natural selection would preserve the versions that 
were functionally superior in that particular environ- 
mental context (adaptations). Whenever an environ- 
ment changed, those organisms that already had the 
adaptations necessary to survive would do so, whereas 
those lacking appropriate adaptations would not. The 
production of organismal diversity thus required that 
organisms be at once autonomous from, and sensitive 
to, the environment, another example of complexity. 

Darwin visualized his viewpoint about the com- 
plexity of evolution with two metaphors, the phyloge- 
netic tree and the tangled bank. The phylogenetic tree 
points to complexity arising from irreversible phe- 
nomena. By referring to species as “communities of 
descent” and placing them in a single “tree of life,” 
Darwin emphasized that the fundamental explanatory 
principle is shared history. The tangled bank, by 
contrast, points to complexity arising from biological 
associations that do not share the same temporal and 
spatial origins, producing reticulated (tangled) pat- 
terns of descent and association. 

By the end of the 19th century, most biologists had 
adopted the view that evolution had occurred, but 
many were uncomfortable with Darwinism’s lack of 
simple general laws, and the discovery of the “Laws of 
Inheritance” by Mendel did little to quell this unrest. 
Bowler (1983) documented the eclipse of Darwinism 
during the period of 1890-1940. The most influential 
theories during that time were neo-Lamarckism and 
orthogenesis, both theories of simplicity. As a theory 
of environmental determinism, neo-Lamarckism 
placed causality solely in the nature of the conditions. 
that 
environment played little or no role in evolution, 


Orthogeneticists, by contrast, asserted the 
thereby placing causality solely in the nature of the 
organism. The eclipse of Darwinism is assumed to 
have ended with the emergence of neo-Darwinism in 
the 1930s. Neo-Darwinism focused on Darwin’s key 
mechanism, natural selection. Buoyed by advances in 


understanding the mechanisms of genetic inheritance 
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and some elegant experiments, this became the 
dominant theory for evolutionary biologists during 
the second half of the 20th century. Some founders 
fully expected neo-Darwinism to encompass the 
panorama of phenomena encompassed in Darwin's 


original formulation: 


“...in every part of the whole, wonderful history of 
life, all the modes and all the factors of evolution are 
inextricably interwoven. The total process cannot be 
made simple, but it can be analyzed in part. It is not 
understood in all its appalling intricacy., but some 
understanding is in our grasp, and we may trust our 
own powers to obtain more.” 
—Simpson (1953: 393) 


Sadly, this was not to be the case. As noted by 
Gould (1983) using the rubric “the hardening of the 
synthesis,” during the latter third of the 20th century, 
neo-Darwinism maintained a narrow focus on popu- 
acknowledging other aspects of 
that all of 
manifestations of population genetics and natural 


lation genetics, 


evolution but asserting them were 
selection and, therefore, all appearances of complexity 
could be reduced to the sometimes computationally 
complex, yet ontologically simple, principles of 
population genetics. Like the neo-Lamarckians, neo- 
Darwinians focused their attention on the environ- 
mental conditions correlated with the survival of 
organisms, populations, and species. Thus, many 
explanations expounded by neo-Darwinians sound 
Lamarckian because the nature of the organism has 
been reduced to almost nothing. For many today, the 
nature of the organism is to be a blind watchmaker 
made up of selfish genes. Organisms exist, but they 
have no “nature” autonomous from their surroundings. 
This produces a theory that is parsimonious, yet 
lacking Darwin’s panoramic view of biological 
diversity. 

During the past 25 years, a growing number of 
scientists and philosophers have called for modifica- 
tions of evolutionary theory (e.g., Eldredge, 1985, 
1986, 1989; Brooks & Wiley, 1986, 1988; Maynard 
Smith & Szathmary, 1995; Brooks, 1998, 2000, 2001, 
2002; Collier & Hooker, 1999; Brooks & McLennan, 
2000 and references therein). Many of these calls seek 
to reintroduce Darwin’s postulates about “the nature 
of the organism” and the “nature of the conditions” to 
develop analytical methods for integrating the tree of 
life and the tangled bank metaphors, and to establish 
fundamental guiding principles or null hypotheses for 
complex evolutionary phenomena occurring on differ- 
ent spatial and temporal scales. 

The phylogenetics revolution that began in the late 
1960s 1966) 


beginning in the 1980s (for a review, see Brooks & 


and became entrenched 


(Hennig, 


McLennan, 2002) produced quantitative analytical 
methods for effectively documenting the complexity of 
the tree of life. Phylogeneticists discovered quickly 
that most phylogenetic tree complexity is manifested 
in a relatively small number of linearly organized 
historical correlations (which we consider to be the 
communities of descent) and, therefore, is (once again) 
computationally complex but ontologically simple. 
Methods developed for analyzing historical associa- 
tions described by tangled bank complexity assumed 
that such associations could also be adequately 
described and explained using phylogenetic tree— 
based reasoning (e.g., Brooks, 1985). Tangled bank 
complexity, however, produces reticulated historical 
relationships in addition to the relatively simple linear 
ones. We now know that methods of analysis based on 
phylogenetic tree-based reasoning produce internally 
inconsistent results in direct proportion to the degree 
of tangled bank complexity in the data, an indication 
that those methods have oversimplified the data (van 
Veller et al., 1999, 2000, 2001, 2002, 2003; Dowling, 
2002; Dowling et al., 2003). Wojcicki and Brooks 
(2004, 2005) developed an algorithm called phyloge- 
netic analysis for comparing trees (PACT) that permits 
phylogenetic systematic principles to be used in 
describing tangled bank type complexity. 

PACT is based on the assumption that tangled bank 
complexity is based on individual parts that each 
conform to relatively simple tree-like complexity 
which, when combined, may nonetheless produce 
of both 


relationships, including the possibility of reticulated 


complex patterns general and unique 
relationships. PACT uses three guiding principles: (1) 
Assumption 0 (Wiley, 1986, 1988a, b; Zandee & 
Roos, 1987): in order to not oversimplify, you must 
analyze all input data without modification, and your 
final result must be logically consistent with all input 
data; (2) the duplication rule (Brooks & McLennan, 
2002): Assumption O is violated with analyses by 
methods of simplicity when the data result from 
reticulated processes. Oversimplification can be 
avoided (i.e., Assumption O can be satisfied) in such 
cases by duplicating entities with reticulated histo- 
ries; and (3) epistemological parsimony (Brooks & 
McLennan, 2002): in order to find the maximum 
number of generalities supported by the data, entities 
should not be duplicated beyond necessity. Therefore, 
we should make only enough duplications to satisfy 
Assumption 0. Underscoring its connection with the 
ontology of complexity, note that Assumption O0 is 
more fundamental than parsimony. If evolutionary 
patterns were simple, Assumption 0 would never be 
violated, so we would never need to duplicate entities 
and would never need parsimony. What we are 


seeking is the most general explanation possible 
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—I. PACT analysis for simple congruence. Top = two hierarchical patterns based on different data but 


pertaining to entities A, B, C, and D. Second from top = the two hierarchical patterns aligned by common elements. Third from 
lop = both hierarchical patterns superimposed on each other. Bottom = PACT solution based on applying combination rule 1 
(Y + Y = Y). —2. PACT analysis for two incongruent patterns. Top = 
pertaining lo entities A, B, C, D, E, and F. Second from top = the two hierarchical patterns aligned by common elements. Third 
from top = both hierarchical patterns superimposed on each other. Bottom = PACT solution based on applying combination rule 


two hierarchical patterns based on different data but 


Zine = YN) YA) YN (AR) = 
given the data (i.e., the ontology of complexity), not 
the simplest explanation we can imagine (i.e., the 
ontology of simplicity) (see van Veller & Brooks, 
2001; Brooks & McLennan, 2002). What we have to 
guard against is the use of models and analytical 
methods that oversimplify the data in an effort to 
generalize. That is, simplicity is not always the most 
parsimonious depiction of the real world (van Veller & 
Brooks, 2001). 

PACT utilizes any data that can be represented as 
hierarchical strings, ranging from gene trees and other 
character state trees to species phylogenies. Wojcicki 
and Brooks (2004, 2005) recognized that nested 
(historical) hierarchical patterns in nature are combi- 
nations of only six different classes of pattern 
modules. Analysis of a set of data is accomplished 
by selecting any one of the input data strings as the 
initial template. Next, the final analysis is built by 
sequentially aligning each additional data string with 
the template. Thereafter, combinations are made to 


satisfy Assumption O using four combination rules. 


YN (AF): Y (E) + YN (ED) = YN (ED): ¥ 


(B) + YN (C(DE)) = YN (B(C(DE))) 


The combination rules are illustrated in Figures 1—6 
and explained briefly in the figure legends; for more 
details, see Wojcicki and Brooks (2004, 2005). Some 
workers seem to have assumed that PACT is an 
algorithm for the method called Secondary BPA 
(Brooks & McLennan, 2002). A purported critique of 
PACT by (2008) addresses 


shortcomings of secondary BPA overcome by PACT 


Salvador Arias et al. 
that were specifically addressed by Wojeieki and 
Brooks (2004, 2005). 

Elucidating complex evolutionary patterns using 
PACT comes with a cost of sorts. Explaining the 
complexity of a set of observations can only be 
achieved by abandoning a priori expectations of 
mechanism. For example, is the presence of a species 
in area F in Figure 2 an indication of a unique event 
involving the production of a novelty in one of the data 
strings, or is it an indication of a unique event 
involving loss from the other? PACT produces the 
same result regardless of the interpretation, so no 


assertions or assumptions about processes are em- 
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—3. PACT analysis for two incongruent patterns. Top = two hierarchical patterns based on different 


data but pertaining to entities A, B, C, D, and E. Second from top = the two hierarchical patterns aligned by common 
elements. Third from top = both hierarchical patterns superimposed on each other. Bottom = PACT solution based on 


applying combination rule 2 (Y + YN = YN). Y (A) + YN (AD) = YN (AD); Y (E) + YN (ED) = YN (ED); Y (B) + N (C(DE)) = 
YN (B(C(DE))). Note that this example is precisely the same as the one shown in Figure 2, with the exception that in this case 
we find evidence of reticulated relationships. —4. PACT analysis for two congruent patterns. Top = two hierarchical patterns 
based on different data but pertaining to entities A, B, C, D, and E. Second from top = the two hierarchical patterns aligned by 
common elements. Third from top = both hierarchical patterns superimposed on each other. Bottom = PACT solution based 


on applying combination rule 3 (YN + YN = YNN). Y (A)N (BC) + Y (A) N (DE)) = YNN (A(BC)(DE)). (One should not over- 


interpret the data.) 


bedded in the algorithm. Similar reasoning can be 
applied to concepts such as lineage duplication, 
lineage sorting, and ancestral areas/hosts. All these 
concepts pertain to hypotheses of mechanism, and in 
order to maximize empirical robustness and avoid the 
possibility of circularity, all processes should be 
inferred following an analysis of data, not built into it. 

It is for this reason that PACT makes combinations 
beginning with the youngest members of the data 
strings being compared. This avoids the assumptions, 
unintentionally embedded in all previous methods for 
historical biogeography and coevolution, that (1) all 
species associated today originated together at the 
same place and time; and (2) when there is ambiguity, 
we resolve the ambiguity beginning with the oldest 
(most basal) portions, amounting to an unacceptable 
assumption that the older the evolutionary event, the 
more accurate our information about it. 

PACT analysis is, therefore, only the beginning of a 
study, not an end in itself. Explaining the result of a 
PACT analysis requires reference to guiding princi- 
ples that are more comprehensive than those currently 


in use. We illustrate this briefly for four different 
research programs in comparative evolutionary biology. 


RESEARCH PROGRAMS 
HISTORICAL BIOGEOGRAPHY 


Biogeography is the basis for understanding the 
origin of species, the assemblage of communities, and 
the differential radiation of clades (Brooks & 
McLennan, 2002). In the latter third of the 20th 
century, historical biogeography produced two simple 
and elegant theories that were based on an ontology of 
simplicity rather than complexity. 

The first of these was the equilibrium theory of 
island biogeography (ETIB) (MacArthur & Wilson, 
1963, 1967). This theory is based on the view that 
dispersal from “source” areas to “islands” (actual or 
metaphorical), mediated by island size and distance, 
produces linear log-normal species-area relationships. 
Noise in the system or the effects of contingency 
comprise in situ speciation and extinction. From this, 
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Figure 5. 


PACT analysis for lineage duplication (sympatric speciation). Top = two hierarchical patterns based on 


different data but pertaining to entities A, B, C. and D. Second from top = the two hierarchical patterns aligned by common 


elements. Third from top left = both hierarchical patterns superimposed on each other. Third from top right = PACT solution 
based on applying combination rule 1 (Y + Y = Y). Note that this leaves duplicate (CD) elements coming from a common 
node. Bottom left = two (CD) groups coming from a common node superimposed on each other. Bottom right = PACT solution 


based on applying combination rule | (Y + Y = Y). 


one infers that data that conflict with the expected 
pattern (the “law”’) are the result of historical 
contingencies, and it is therefore permissible to 
remove or modify them. As a result, island biogeog- 
raphers are admonished to study small, young islands 
in order to minimize the potential for such historical 
contingencies that cloud our ability to see the true 
(and simple) pattern. 

The second theory of simplicity was the maximum 
(Max Vic), 
vicariance biogeography or cladistic biogeography 
(Humphries & Parenti, 1999), Max Vic is based on the 
theory that in situ speciation and extinction produce 


vicariance hypothesis also known as 


simple area cladograms in which each area occupies a 


unique position. Noise in the system or the effects of 


contingency result from dispersal. From this, one 
infers that data that conflict with a single area 
cladogram in which each area appears once (the 
“law”) are the result of historical contingencies, and it 
is therefore permissible to remove or modify them. As 
a result, cladistic biogeographers developed Assump- 
tions 1 and 2 to remove or modify (“reconcile”) 
incongruent data with a single simple area eladogram 
in which each area occupies a unique position. 

One persistent concern about both these paradigms 
has been this: if it is necessary to remove and modify 
data and to restrict one’s scope of analysis, just how 
general and powerful are the explanations produced? 
A closer comparison of the two paradigms reveals 
another interesting feature: they are complementary 
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PACT analysis of two partly congruent patterns. Top = two hierarchical patterns based on different data but 


pertaining to entities A, B, C, and D. Second from top = the two hierarchical patterns aligned by common elements. Third from 
top = both hierarchical patterns superimposed on each other. Bottom = PACT solution based on applying combination rule 


4 (Y(Y- +Y = (Y(Y-. Y (A) (Y 


theories, each one excluding the others domain of 
explanation. What is missing from ETIB are assess- 
ments of the geographic origin of species, even though 
this is what distinguishes a “source” from an “island.” 
What is missing in MaxVic are assessments of post- 
speciation movements, and yet this is how ancestral 
species become widespread enough to be affected by 
vicariance. If each of these theories describes 
something valid and each one excludes the others 
explanatory domain, perhaps the problem lies in the 
adoption of ontological parsimony on the part of 


advocates of both theories of biogeography. 


At 
nearly the same time the maximum vicariance 
paradigm emerged, Erwin (1979, 1981) proposed the 
taxon pulse hypothesis as a model incorporating both 


A “new” guiding principle: The taxon pulse. 


dispersal and vicariance. Erwin’s model stemmed 
from an idea proposed by Darlington (1943), later 
named the “taxon cycle” by Wilson (1959, 1961). 


(A— + Y (A-) = Y (A) (Y (A-. This exemplar is also known as the “problem of paralogy.” 


Taxon pulse and taxon cycle models both assume that 
species and their adaptations arise in “centers of 
diversification,” and that distributional ranges of taxa 
stable, 
continuously occupied center. This general biotic 


periodically fluctuate around a more 
dispersal may be interrupted by the formation of 
barriers, producing episodes of vicariant speciation. 
Breakdown of those barriers produces new episodes of 
biotic expansion, setting the stage for yet more 
episodes of vicariance. Taxon cycles occur over 
relatively short periods of time (“ecological time”) 
and involve species that disperse actively and 
colonize new areas during expansion episodes, then 
contract their ranges during periods of habitat 
contraction without producing new species. Taxon 
pulses, by contrast, occur over relatively long periods 
of time (“evolutionary time”) and are characterized by 
dispersal along a broad front during expansion into 
suitable habitat when previous barriers break down. 


During this expansion phase, different species within 
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a biota encounter additional geographic heterogeneity, 
including range contractions. Such heterogeneity may 
(1) stop the expansion of some species, resulting in 
species of restricted distributions; (2) affect only the 
rate of expansion for some species, producing 
widespread species; or (3) act as barriers to 
dispersal of sufficient magnitude to produce new 
species as a result of peripheral isolates speciation. 
Geological evolution, operating on longer time scales 
than biological evolution, may also produce barriers, 
resulting in episodes of vicariant speciation affecting 
members of these same biotas. 

Despite the existence of an alternative to MaxVic 
and despite concerns that exemplar taxa were being 
show a 
(Simberloff et al., 1980; Simberloff, 
1987), vicariance has become the default explanation 


carefully selected to 


vicariance 


for any observation of allopatry. And yet. Max Vic has 


always been deficient because it neglects the issue of 


how ancestral species of many clades become 
widespread enough to be affected by vicariant events. 
If vicariance affects many members of ancestral biotas 
in the same way, it seems reasonable to assume that at 
some point in the past the members of the biota 
expanded their geographic ranges to such an extent 
that they could be affected by the subsequent 
vicariance event. Advocates of vicariance biogeogra- 
phy have acknowledged that this must happen. Wiley 
(L981) that 


colonization of islands, might produce general distri- 


noted some circumstances, such as 
bution patterns based on dispersal rather than 
vicariance, and Endler (1982) suggested that such 
correlated dispersal patterns might be common. In 
practice, however, historical biogeographers have 
simply assumed that such dispersal does not produce 
general patterns, so it is permissible to invoke 
dispersal only to explain departures from the general 
pattern, which is always explained as the result of 
vicariance (Wiley, 1986, 1988a, b; Brooks & 
McLennan, 199], 2002; Humphries & Parenti, 1999). 

‘Taxon pulse-driven biotic diversification differs from 
vicariance-driven biotic diversification in three impor- 
tant ways. First, because diversification is driven by 
biotic expansion, we expect to find general patterns 
associated with dispersal, not just with vicariance. 
General patterns resulting from biotic expansion occur 
when barriers to dispersal, especially the large-scale 
ones leading to vicariance, break down. Second, 
episodes of biotic expansion, even those involving large 
areas, will inevitably lead to reticulated historical 
relationships among areas and biotas within areas of 
endemism comprising species of different ages derived 
from different sources. Third, the absence of particular 
clades in particular areas is more parsimoniously 
explained as a lack of participation in that particular 


preponderance of 
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expansion episode by a particular clade, rather than 
dispersal with extinction. Taxon pulses are also 
historically contingent, meaning that at any given time, 
different clades comprising a complex biota may form a 
mosaic of area relationships. 

Recent studies have shown extensive reticulated 
area relationships even for data sets carefully chosen 
to emphasize vicariance (e.g., Brooks & McLennan, 
2001; Green et al., 2002; McLennan & Brooks. 2002; 
Spironello & Brooks, 2003; Bouchard et al., 2004; 
Brooks & Ferrao. 2005: Brooks & Folinsbee. 2005: 
Halas et al., 2005; Folinsbee & Brooks. 2007). 


4 new research program for historical biogeographic 
analyses. Following Halas et al. (2005). a new 
research program based on an ontology of complexity 
for historical biogeographical analyses comprises two 
steps: 

Step 1—Producing a general area cladogram. 
Convert all phylogenetic trees into taxon-area clado- 
grams by replacing the names of the species with the 
areas they inhabit, then combine them using PACT. 
producing a general area cladogram (GAC), Figure 7 
is the GAC resulting from PACT analysis of elephants, 
hyenas, and hominoids since the Miocene (Brooks & 
Folinsbee, 2005; Folinsbee & Brooks, 2007). 

Step 2—Inferring biogeographic history from 
the GAC. This is a two-part process. First, we musl 
distinguish general from unique nodes. Unique nodes 
are produced by an evolutionary event affecting only a 
single clade. At present, all nodes associated with 
evolutionary events affecting more than one clade are 
termed general nodes. Clearly, a general node 
involving two of 100 clades likely is not the same as 
one involving 75 of 100 clades. At the moment. 
however, we do not have enough empirical data or any 
models to allow us to be more fine-grained in our 
distinctions among general nodes, but this is clearly 
an area of interest for future research. Second. we 
must interpret the general nodes. Until recently. 
historical biogeographers believed that all general 
nodes in an area cladogram should be ascribed to 
vicariance. Lieberman (2000, 2003a. b). however, 
noted that episodes of geodispersal (geographic areas 
fusing rather than splitting) could produce general 
nodes resulting from dispersal. In a similar vein, the 
taxon pulse hypothesis asserts that episodes of general 
biotic expansion occurring between vicariance (or, 
more generally, isolation) events set the stage for 
future isolation events. Lieberman (2000, 2003a, b) 
proposed a protocol for distinguishing general nodes 
in area cladograms due to isolation from those caused 
by geodispersal or biotic expansion. The protocol is 
based on phylogenetic character optimization (Fitch 
optimization), using areas inhabited as the characters. 
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Figure 7. 


General nodes associated with vicariance or other 
forms of isolation exhibit decreasing numbers of areas 
occupied relative to the next oldest node (Fig. 8), 
whereas general nodes associated with biotic expan- 
sion (breakdown of a barrier) are associated with 
increasing numbers of areas occupied relative to the 
next oldest node (Fig. 9). 

Figure 10 summarizes the nodal analysis for the 
GAC shown in Figure 7. The alternation of isolation 
(V, or vicariance) nodes with biotic expansion (BE) 
and within-area (W) differentiation nodes produces 
the characteristic signature of a taxon pulse radiation. 
The relatively large number of general nodes (22 out 
of 28, 79%) that all 


participated in the same general taxon pulse scenario, 


or suggests three clades 


summarized in Figure l1. 


Complexity studies in biogeography: Integrating 
historical and ecological biogeography. That larger 
islands have greater species richness than smaller 
islands, and that “islands” need not be oceanic 
because species richness increases with any increased 
sample of area, have long been recognized. ETIB 
(MacArthur & Wilson, 1963, 1967) predicts a linear 
log-normal function relationship between species 
richness and the size of an island resulting from a 
that 


colonization from a source area. and extinction. The 


dynamic balance between immigration, is, 


extinction rate is assumed to increase with the number 
of species present on any island, so that small areas 
with higher species richness than expected have a 


AS 
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PACT analysis of elephants, hyenas, and hominoids since the Miocene (Brooks & Folinsbee, 2005; Folinsbee & 
Brooks, 2007). General area cladogram. AF = Africa, AM = Americas, AS = Asia, EU 


= Europe. 

higher extinction rate than immigration rate and are 
not yet in equilibrium. MacArthur and Wilson (1963) 
acknowledged that “local speciation” (in situ specia- 
tion) would confound the species-area relationship, 
but suggested that for most cases it was probably safe 
to omit in situ speciation from the model as its effect 
on the species-area relation is “probably significant 
only in the oldest, largest, and most isolated islands” 
(1963: 380). Recent 


suggested that such historical phenomena require 


discussions, however, have 
closer investigation (e.g., Heaney, 2000; Whittaker, 
2000). 
Halas et al. (2005) performed the first direct 
analysis of the phylogenetic impact on the species- 
the 
presented by Marshall and Liebherr (2000), repre- 
of 


flowering plants occurring throughout Mexico and 


area relationship using extensive data set 


senting 33 clades insects, vertebrates, and 
parts of Central America. Nodal analysis of their GAC 
permitted them to classify each species in each area 
as a colonizer or a resident. Correlating species 
richness and area size for the 33 clades and nine areas 
in Mexico and Central America (for details, see Halas 
et al., 2005) produced a low correlation coefficient for 
the species-area curve (r° = 0.47), due primarily to 
two relatively small areas (the Transmexican Volcanic 
Belt and the Sierra Madre del Sur) containing 
unusually large numbers of species. Correlating 
extinction events and species richness for this data 
set produced a high correlation coefficient (r? = 0.75), 
indicating strong support for this prediction of the 
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Figures 8 and 9. 


cD 


CDE 


—8. Lieberman’s nodal analysis 1. Thick branches indicate general nodes; thin branches represent 


elements of the GAC produced by only one clade. Progressive decrease in the number of areas associated with nodal values 
indicates that all general nodes are isolation nodes. This corresponds to either maximum vicariance or Hennig’s progression 
rule, —9, Lieberman’s nodal analysis 2. Thick branches indicate general nodes; thin branches represent elements of the GAC 
produced by only one clade. Increase in the number of areas associated with nodal values from node D to node CDE indicates 


an episode of biotic expansion or general dispersal; decrease in the number of areas associated with nodal values from CDE to 


CD indicates an episode of isolation, corresponding to either vicariance or Hennig’s progression rule. Alternation of biotic 


expansion and isolation nodes is characteristic of taxon pulse radiations. 


ETIB. Following Macarthur and Wilson’s lead, Losos 
and Schluter (2000) reported that, for anoline lizards 
on large Caribbean islands, inferred extinction rates 
were low and in situ speciation was a more important 
source of species richness than colonization. They 
predicted that effects similar to those they observed 
on the largest islands should be found on continental 
Halas et al. (2005) corroborated those 
predictions: in situ speciation correlates better with 


islands. 


area size than does colonization, and inferred 
extinction rates are low. The protocol produces 
inferences of only 19 extinction events involving 16 
of the 33 
inferences of extinction can be made only for episodes 
the 


following biotic expansion events, which account for 


clades, because direct parsimonious 


of vicariance. If extinction rates are same 
60% of the general nodes, the number of inferred 
extinctions increases from 19 to 48, but does not 
change the correlations with area size. 

These data provided additional insight into the 
evolutionary relationship between colonization and in 
situ speciation. There is a very poor correlation 
between colonization and in situ speciation (° = 
0.02), indicating that these phenomena are relatively 
independent of each other. We suggest that the reason 
large islands confound the 


historical effects on 


species-area relationship is the subsequent dispersal 


of some species produced in situ to other islands, so 
sources become islands and islands become sources 
on evolutionary Lime scales. All nine areas studied by 
Halas et al. (2005) have served as both sources and 
islands at different times and to different degrees. 
Halas et al. (2005) 


episodes of vicariance and nine of biotic expansion 


identified six fundamental 
as the main determinants of the biogeographic 
patterns observed in the data set. The Transmexican 
Volcanic Belt has acted as an “island,” receiving 
species by colonization, for all nine biotic expansion 
events, and the Sierra Madre del Sur has acted as a 
dispersal source for three of the episodes of biotic 
expansion. At the same time, both areas were involved 
in five of the six episodes of vicariance. This explains 
why these relatively small areas are disproportionately 
species rich, without any evidence of an accompany- 
ing high extinction rate. 

The analysis by Halas et al. (2005) indicates that 
MacArthur and Wilson (1963, 1967) were correct 


situ speciation confounds the species-area relation- 


in 


ship. This occurs when the same areas function as 
islands and as sources at different times. How can this 
happen? The answer is niche diversification associ- 
ated with speciation, producing island species that 
can colonize the source area(s). Niche diversification 
much slower rate than 


most often occurs at a 
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Figure 10. 


PACT analysis of elephants, hyenas, and hominoids since the Miocene (Brooks & Folinsbee, 2005; Folinsbee 


& Brooks, 2007). General node analysis. AF = Africa, AM = Americas, AS = Asia, BE = biotic expansion, EU = Europe, 


V = vicariance, W 
of taxon pulse radiations. 


speciation (Brooks & McLennan, 2002), but the 
biogeographic “role reversal” resulting from niche 
diversification need not occur often in order to affect 
the equilibrium number of species in a given area. In 
addition, Cracraft (1985) seems to have been correct 
in asserting that speciation and extinction are under 
different rate controls. In the study by Halas et al. 
(2005), extinction rates correlate well with area size, 
but speciation rates do not. How can this happen? 


Dispersal 
into Africa 


Dispersal 
into Asia 


Figure 11. 


Dispersal 
into Asia 


within area. Alternation of biotic expansion/within-area and vicariance (isolation) nodes is characteristic 


According to Cracraft (1985), speciation is largely 
controlled by geography and extinction by environ- 
mental harshness, associated with adaptation and 
niche diversification. That is, extinction is a failure to 
adapt, not a failure to speciate (Brooks & McLennan, 
2002). Occasional episodes of niche diversification 
plus episodic changes in environmental harshness are 
the primary source of the “pulses” in the taxon pulse 
hypothesis. The study by Halas et al. (2005) suggests 


Dispersal 
into Asia 


Node 18 - Vicariance 
Out of Africa 3 


Node 17 - Biotic expansion 
Out of Africa 2 


Node 16 - Vicariance split 
Node 11 - Biotic expansion 
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PACT analysis of elephants, hyenas, and hominoids since the Miocene (Brooks & Folinsbee, 2005; Folinshee 


& Brooks, 2007). Pictorial representation of the taxon pulse scenario that best explains the data. 
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that Losos and Schluter (2000) were not correct in one 
respect: in situ speciation and colonization are not 
necessarily causally linked, such that high rates of 
one need not cause low rates in the other. If speciation 
and extinction are under different rate controls, the 
appearance of fit to their simple deterministic model 
was likely the result of using only a single clade for 
their analysis. 

The combination of speciation and niche diversi- 
fication makes the equilibrium number of species a 
moving target in evolutionary time; that is, evolution is 
a non-equilibrium phenomenon (Brooks & Wiley, 
1988). This complements recent calls by, e.g., Heaney 
(2000) and Whittaker (2000) for modifications of the 
ETIB to incorporate complex patterns of immigration, 
extinction, and diversification occurring on various 
spatial scales and on both ecological and evolutionary 


lime seales. 


CORKS OLUTION 


Fahrenholz (1913) focused entirely on the hosts 
when he postulated that the occurrence of related 
blood-sucking lice on different primates demonstrated 
that the catarrhines were more closely related to 
hominoids than to any other primates. By the late 
1930s, this orthogenetic perspective had produced an 
integrated view of coevolution: parasites are highly 
host specific, so they coevolve with their hosts, and 
because they coevolve with their hosts, they become 
highly host specific. Because host specificity was the 
cause of coevolution rather than a function of the 
ecological interaction between lineages, any conflict- 
ing or inconsistent observations were considered to be 
erroneous or irrelevant because they failed to conform 
to the orthogenetic view of coevolution. Vestiges of 
orthogenetic thinking persist today; this 1s especially 
true for the assumption that hosts and parasites 
“ought” to have congruent phylogenies, embodied in 
the maximum cospeciation school of research (Page, 
2002). 

In contrast with the orthogenetic view of coevolu- 
tion, Kellogg (1896, 1913) cast his observations about 
host-parasite relationships in a Darwinian framework. 
His studies of birds and their biting lice suggested 
that while some host-parasite systems might show 
strong phylogenetic associations, there were substan- 
lial cases of what he termed “straggling” or “host- 
switching.” This perspective was adopted primarily by 
researchers interested in studying the interactions 
(e.g., 


Because 


insecis 


1924). 


between plants and phytophagous 
Verschaffelt, 1910; Brues, 1920, 

those associations often showed no clear phylogenetic 
component with respect to host species (though they 
often were extremely specific), researchers in this 


tradition focused on discovering the ecological ties 
between organisms, particularly the cues insects used 
to locate their host plants. The modern version of this 
second perspective on coevolution emerged in the 
1960s. Following a mathematical model proposed by 
Mode (1958), Ehrlich and Raven (1964) hypothesized 
that the evolutionary diversification of plants and 
insects had been fueled by complex coevolutionary 
interactions involving mutual modification. As sug- 
gested initially by Kellogg, such coevolutionary 
dynamics might have a general phylogenetic context, 
but need not parallel the evolutionary history of the 
specific taxa involved. The distribution of insects 
among plants followed the evolution of host resources 
and the evolution of insects’ abilities to utilize those 
resources, rather than the evolution of host species 
themselves. 

Janzen (1968, 1973a, b, 1980, 1981, 1983, 1985a, 
h) argued that the appearance of tight coevolutionary 
associations at any single locality could be mislead- 
ing. No matter where a given species evolved in the 
first place, its inherited functional abilities may allow 
il to survive in a variety of places under a variety of 
conditions through arbitrary amounts of time. In other 
words, species and their phylogenetically conservative 
traits may disperse through time and space. He called 
this interaction between the past history of the species 
and their present-day associations “ecological fitting” 
(Janzen, 1985b). 

All perspectives that 


disagree 


discussed above agree 
parasites are resource specialists. They 
fundamentally on the question of host switching. 
Can you be a resource specialist and still host switch 
readily? Brooks and McLennan (2002) discussed a 
number of ways in which this could happen. 
Beginning with the parasite, a species might be a 
resource specialist but also might share that specialist 
Irait with one or more close relatives. That is, 
specialization on a particular resource can be a 
plesiomorphic characteristic of a clade of parasites. 
As a result, a given host species occurring in more 
than one area might be inhabited by two different 
species of parasites, related to each other but not 
necessarily as sister species. Such persistent plesio- 
morphic traits also might be co-opted to perform novel 
functions. Trouvé et al. (1998), for example, reported 
that life history traits of parasitic flatworms do not 
differ from life history traits of their free-living 
relatives, indicating that these species do not have a 
“parasitic mode of life” but rather a “platyhelminth 
mode of life” that has been co-opted to function in the 
conlext of parasilizing vertebrates. Persistent ances- 
tral traits also might be “anachronisms” (Janzen & 
Martin, 1982), traits that evolved in a coevolutionary 


context that no longer exists. For example, a number 
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of the Neodermata (the clade of 


platyhelminths including trematodes, monogeneans, 


of members 


tapeworms, and their relatives) have guts, even though 
they also have a highly absorptive integument (the 
neodermis). In the case of the tapeworms, having a 
neodermis facilitated survival even when the gut was 
lost. The persistent intestine in other neodermatans is an 
anachronism, like the vermiform appendix in humans. 

Alternatively and from the hosts’ perspective, the 
resources themselves might be very specific and yet 
still taxonomically and geographically widespread. In 
such cases, a given parasite might inhabit more than 
one species of host, the hosts not being sister species, 
as a result of host switching. Two major categories of 
parasite specialization are preferred site of infection 
(Adamson & Caira, 1994; Brooks & McLennan, 1993), 
as well as their transmission dynamics (Brooks & 
McLennan, 1993; Hoberg & Adams, 2000; Hoberg et 
al., 2000).° Phylogenetic conservatism in parasite 
biology, coupled with phylogenetic conservatism in 
host biology, would create a very large arena for host 
switching, even without the evolution of novel 
capabilities for host utilization. 

If historical conservatism in the type of specializa- 
tion, rather than specialization per se, determines the 
ease or difficulty of host switching, the extent of such 
historical conservatism should dictate our expecta- 
tions about the frequency of occurrence of host 
switching. Two ubiquitous findings from the historical 
ecology revolution of the past 20 years are that all 
aspects of evolution, including ecology and behavior, 
are phylogenetically conservative, and that host switch- 
ing is a regular feature of coevolutionary history (Brooks 
& McLennan, 1991, 1993, 2002). Moving phylogenetic 
studies of coevolution into the age of complexity thus 
requires the use of methods of analysis based on the 
recognition that host switching is common, not rare, and 
that the fundamental job of the investigator is to explain 
the switches that have occurred and, where possible, to 
anticipate switches that could occur rapidly during 
periods of substantial climate change. 


A new guiding principle: The oscillation hypothesis. 
In comparative studies of coevolution, host shifts, 
including increasing host range and speciation by host 
switching, are the analog of dispersal in historical 
biogeography. Complexity studies of coevolution 
should, thus, be based on an analog of the taxon 
pulse hypothesis. Phylogenesis in taxa exhibiting 
“close and evident” ecological associations with host 
taxa should be characterized by episodes of increasing 
host range followed by episodes of isolation on 
Nylin (2007) have 
recently proposed such a model, which they call the 


(Janz et al., 2006; Nylin & 


particular hosts. Janz and 


“oscillation hypothesis” 
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Janz, 2007), and provided empirical evidence 


supporting it based on studies of various lepidop- 
teran taxa and their host plants. 


A new research program for coevolution studies. 
Within the realm of an ontologically complex research 
program, comparative studies of host-parasite associ- 
alions use the following two steps: 

Step 1—Producing a general host cladogram. 
Convert all parasite phylogenetic trees into parasite- 
host cladograms. This is accomplished by replacing 
the names of the parasite species with the hosts they 
inhabit. Then combine them using PACT—this 
produces a general host cladogram (GHC), a map of 
the host context of parasite speciation. We illustrate 
this approach using the pinworms (Enterobius Leach 
[1853]) and hookworms (Oesophagostomum [Conoweb- 
eria] Ihle [1922]) inhabiting the great apes including 
humans (for details with references, see Brooks & 
McLennan, 2003: Brooks & Ferrao. 2005). The GHC 
is shown in Figure 12. 

Step 2—Assessing the host switches. The heavy 
branches on the host cladogram (Fig. 12) indicate 
instances of congruence between host and parasite 
phylogeny. This includes one case (Enterobius gregorii 
Hugot [1983] and EF. vermicularis |Linnaeus, 1758] 
Leach [1853], both inhabiting humans) in which a 
parasite clade speciated but the host did not, and 
another (Oesophagostomum stephanostomum Stossich, 
1904) in which the host clade speciated but the 
parasite did not. Despite the substantial amount of 
which 
happens to be congruent with the host phylogeny, 
the PACT analysis suggests that about 30% of the 
observed host associations are due to host switching. 


cospeciation between the parasites, also 


Only one represents a switch to a non-primate host 
(rodents). The remainder, all of which involve primate 
to primate switches, include acquiring parasites that 
are more or less the same age (three cases), that are 
younger (one case), and that are older (three cases) 
than the hosts. 


Complexity studies in coevolution: Integrating 
biogeography We 
evolution range to occur over 


and coevolution. expect the 
of increased host 
relatively long time periods (Janz et al., 2006; 
Janz & Nylin, 2007; Nylin & Janz, 2007). This has 
led evolutionary biologists to predict that while 
generalists may be at an evolutionary disadvantage 
relative to specialists with respect to particular host 
species in the short term, they are at an advantage in 
the long term. Generalists have this advantage over 
specialists because of their ability to utilize multiple 
hosts, which increases their chances of survival 
climate change and 


during episodes of major 
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General host cladogram produced by PACT, indicating host context of speciation events implied by 


phylogenies for Enterobius Leach (1853) and Oesophagostomum (Conoweberia) thle (1922). Heavy lines indicate associations 
congruent with host phylogeny; thin lines indicate host-switching events. 


associated changes in species composition and trophic 
structure in affected ecosystems. Brooks and Mce- 
Lennan (2002) suggested that ecological fitting, in the 
form of phylogenetically conservative host utilization 
‘apabilities, produces a more complex situation. In 
addition to true generalists, actively using different host 
species representing different resources, and true 
specialists, capable of surviving in association with 
only a single species of hosts, there are what Brooks and 
McLennan (2002) called “faux generalists” and “faux 
specialists.” Faux generalists are resource specialists 
whose resource is widespread among host species 
(synapomorphic, symplesiomorphic, or homoplasious). 
At any given place and time, only a restricted subset 
(often only a single species) of all potential host species 
are available to the specialist, but over a large 
geographic range, the specialist may be associated 
with many hosts. By having a large class of potential 
hosts, this class of specialists may have the same long- 
term evolutionary advantages as true generalists without 
giving up their short-term advantages as resource 
specialists. Faux specialists, by contrast, are resource 
generalists who, at any given place and time, are 
excluded from some suitable hosts by specialists on 
those hosts. By participating in associations with a small 
number of hosts, such generalists may have many of the 
same short-term benefits as true specialists, without 
losing the long-term advantages of being a resource gen- 


eralist. 


The most important aspect of this recognition by 
Brooks and McLennan (2002) is that true generalists, 
faux generalists, and faux specialists can all switch 
hosts rapidly during episodes of climate change. This 
has significant implications for public, livestock, and 
wildlife health specialists (Brooks & Hoberg, 2006, 
2007a, b). One critical implication of the maximum 
cospeciation perspective is the expectation that 
progressive coevolutionary specialization mitigates 
host switches. This means that coevolution should 
provide its own safeguards against emerging infectious 
diseases (EIDs). We would thus expect EIDs to be 
rare events evolutionarily, and the current crisis 
would be seen as something different in kind rather 
than in degree from the evolutionary background of 
coevolution. Under the Darwinian view, by contrast, 
EIDs are not rare events but are common evolutionary 
accidents waiting to happen. In that case, the current 
EID crisis is different only in degree, not in kind, from 
the 


present. The potential for host switching is therefore 


coevolutionary dynamics occurring prior to 
proportional to the number of susceptible hosts that 
are not “apparent” to a parasite al any given time. The 
apparency arena, although phylogenetically conserva- 
tive, is not static. Environmental changes in the place 
of origin for any parasite, leading to altered trophic 
interactions or to geographic dispersal, increase the 
chances that the parasite will come into contact with 
additional susceptible hosts. Each time this happens, 
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Area cladogram produced by PACT, indicating geographic context of speciation events implied by phylogenies 


for Enterobius and Oeoshagostomum (Conoweberia). Heavy lines indicate area associations shown by both parasite clades; thin 


lines indicate area associations shown by only one clade. Areas at nodes indicate episodes of isolation (in either “Asia” or 
“Africa”) or of biotic expansion (“Africa to Asia,” “Asia to Africa,” or “Asia and Africa”). 


the parasite becomes an EID of the newly acquired 
host. 

Brooks (1985) first called attention to analytical 
similarities in phylogenetic studies of coevolution and 
biogeography. Given this, it is fairly straightforward to 
integrate information from both types of studies to 


obtain some understanding of the evolutionary basis of 


EIDs. When the background is one of alternating 
episodes of vicariance and biotic expansion, therefore, 
episodes of rapid host switching associated with the 
biotic expansion events may result. The biogeographic 
analysis of Enterobius and Oesophagostomum (Con- 
oweberia) (Fig. 13) suggests a history of speciation 
involving alternating episodes of isolation in, and then 
movements between, Africa and Asia. Furthermore, 
five of the eight (63%) host switches discovered in 
Figure 12 occurred during periods of biotic expansion 
discovered in Figure 13. The exceptions are the 
switch from primates to rodents in the common 
ancestor of the parasites O. xeri Ortlepp (1922) and 
O. susannae Leroux (1940), which took place in 
Africa, Travassos & 
Vogelsang (1932) in orangutans and OQ. raillieti 
Travassos & Vogelsang (1932) in gibbons (perhaps 
representing a case of sympatric speciation), which 


as well as O. blanchardi 


took place in Asia. Together, these observations 
suggest that these host switches, all of which would 
have been called EIDs at the time of their origins, 
with the possible exception of the switch from 
primates to rodent by the common ancestor of O. xert 
and QO. susannae, resulted from changes in the 


apparency arena rather than the evolution of novel 
capabilities for host-utilization (Brooks & Hoberg, 
2006, 2007a, b). 

EIDs may also arise from host switches mediated by 
ecological fitting within the area of origin when 
climate changes alter trophic structure. For example, 
when ancestral humans moved out of the African 
forest and onto the savannah during the late Pliocene 
and early Pleistocene, they made a rapid transition 
from herbivory to facultative carnivory to active 
predation. During that time, humans apparently 
shared more than just food with other apex carnivores, 
becoming hosts to species of cestodes in the genus 
Taenia L. (1758), whose closest relatives inhabit 
hyenas, large cats, and African hunting dogs (Hoberg 
et al., 2000, 2001). Thus, colonization of hominids was 
driven by shared trophic structure in foraging guilds 
where apex carnivores and ancestral humans foraged 
on an array of antelope prey that served as inter- 
mediate hosts for species of Taenia. 

Until PACT has 


biogeographic and coevolutionary associations among 


now, been used to assess 
species representing multiple clades. Here, for the 
first time, we demonstrate how PACT can be used to 
help further studies of species, speciation, and 
phylogenetics. Because these are new proposals for 
applications of Assumption O analysis, we begin with 
the discussion of basic principles. 

According to Darwin, the origin of species is 
historically contingent in space and time (Darwin, 
1872). Darwinian thought has always considered 
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species to be complex systems, whether we consider 
them to be communities of descent (Darwin, 1872), 
historical lineages (Simpson, 1944; Wiley, 1978, 1981), 
1966), or historically 
cohesive information systems irreversibly split from 
other such units (Brooks & Wiley, 1988; Kornet, 1993; 
Maynard Smith & Szathmary, 1995) through a variety of 


cladogenetic units (Hennig, 


cohesion-breaking mechanisms (Brooks & McLennan, 
2002). Evolutionary phenomena affecting and produc- 
ing species operate at the level of reproductive and 
ecological interactions among individual members of 


various populations that comprise them. 


TOKOGENETICS AND PHYLOGEOGRAPHY 


One of the most fundamental assertions made by 
Hennig (1966) was the distinction between “tokog- 
eny” and “phylogeny.” He asserted that tokogenetic, 
or within-species, relationships could not be analyzed 
using phylogenetic systematic principles due to 
reticulated breeding relationships among organisms, 
but that phylogenetic, or among-species, relationships 
could be analyzed using his method because there 
were no reticulated relationships in phylogeny. By 
implication, clades that have experienced episodes of 
hybrid speciation are excluded from Hennigian 
analysis. The majority of biologists have accepted 
this self-imposed limitation until recently (see, e.g., 


Posada & Crandall, 2001). 


A “new” guiding principle. In discussing empirical 
criteria for demarcation among species, Sites and 
Marshall (2003, 2004) followed Hennig (1966) and 
Kornet (1993) in suggesting that populations showing 
reticulated relationships should be considered a 
single species, whereas populations not showing 
reticulated relationships should be considered more 
than one species. Similarly, Riddle and Hafner (2004) 
suggested that area reticulations should be more 
common when dealing with populations of the same 
species than 


when dealing with populations of 


different species. The developments of Assumption 0 
PACT that 


systematic reasoning can be applied to complex 


and of demonstrate phylogenetic 
historical systems whose members have reticulated 
relationships. Therefore, viewed in the light of 
complexity studies, phylogeography (Avise, 2000) is 
the study of the history of tokogenetic relationships. 


A new research program for tokogenetic analyses. 
Within the realm of an ontologically complex research 
program, phylogeographic analyses of entities (popu- 
lations) that show tokogenetic relationships use the 
following two steps: 


1. PACT analysis of all available data. Convert 
all gene trees into taxon-area cladograms (Fig. 14). 
This is accomplished by replacing the names of the 
haplotypes distinguished by each gene tree with the 
areas they inhabit. Ideally, both mitochondrial and 
nuclear genes should be used. the former because 
they retain patterns of introgression well, and the 
latter because they provide direct evidence of gene 
flow. Then combine the taxon-area cladograms using 
PACT. This produces a GAC (Fig. 15) depicting the 
geographic context of the deployment of genetic 
information within and among the geographically 
defined populations. 

2. Assessing species boundaries and patterns 
of genetic cohesion. As suggested by Sites and 
Marshall (2003, 2004), 


between tokogeny and phylogeny provides a straight- 


the Hennigian distinction 


forward empirical criterion for determining how many 
different species are represented im a sample of 
populations. Species (macro-species of Kornet. 1993) 
are identified by the largest inclusive subset (clade) of 
populations (micro-species of Kornet, 1993) that show 
reticulated relationships. Using this formalism, Fig- 
ure 15 depicts three species, comprising populations 
a-d, populations e-h, and populations i-l. respec- 
tively. 

The next step is to ascertain the patterns of genetic 
cohesion among the populations within each species. 
A truly panmictic species would be characterized by 
all populations showing reticulated relationships with 
each other, producing a GAC in the form of a single 
unresolved polytomy for all populations, In a 
Darwinian world, historical contingencies of heredity 
(the nature of the organism) and circumstances (the 
nature of the conditions) should produce various 
degrees of within-species cohesion. The hypothetical 


three-species system in Figure 15 


illustrates this 
point, Each species is characterized by four popula- 
tions. Species | (populations a-d) has one unique 
population (d) and three populations exhibiting a total 
of four reticulations (one each for populations a and c, 
two for population b), combining for eight inter- 
population connections. Species 2 (populations e—h) 
has two unique populations (f and h) and two 
populations exhibiting a total of two duplications 


(one each for e and g), 


combining for 13 inter- 
population connections. Finally, species 3 (popula- 
tions i-l) has one unique population (j) and three 
populations exhibiting a total of four duplications (one 
each for k and l, two for i), combining for 14 inter- 
population connections. If the magnitude of gene flow 
is comparable in all inter-population connections, 
then species 2 and 3 would be deemed more cohesive 
than species l, and species 2 might be considered to 
have achieved its high degree of cohesion more 
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Figure 14. PACT and phylogeography 1. Three gene trees for various populations. 


efficiently than species 3, needing only two duplica- 
tions to achieve 13 inter-population connections as 
opposed to four duplications to achieve 14 inter- 
population connections. 


PHYLOGENETICS 


The Hennigian assumption that phylogeny can be 
represented accurately by an internested branching 
diagram is refuted each time a new species is formed 
by the hybridization of two (or more) other species. 
This produces a reticulated phylogenetic history that 
cannot be represented by a purely internested 


branching diagram that is the result of analyses with 
standard phylogenetic methods. 

Most zoologists consider hybrid speciation to be 
rare in animals, but botanists have recognized for a 
long time that many plant species have been formed in 
this manner (Vriesendorp & Bakker, 2005 and 
examples therein). Furthermore, the endosymbiont 
theory of the origin of eukaryotes (e.g., Schwartz & 
Dayhoff, 1978; Margulis, 1981; Sitte, 1993) postulates 
that certain major transitions in evolution (sensu 
Maynard Smith & Szathmary, 1995) are the result of 
hybridization. At least some prokaryotes have an 
interesting ability that refers to the capability of 
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Figure 15, 


producing reticulated phylogenetic relationships. In 
times of stress, these prokaryotes not only shut down 
their metabolic activities but may actually jettison the 
portion of their genomes associated with metabolic 
activities. Furthermore, they are capable of regaining 
genetic machinery for metabolic activities, but not 
always from a close relative. 

Partial hybridization is a common process for 
prokaryotes (Woese, 2000, 2002). In bacteria, genes 
for resistance to antibiotics are usually situated on 
plasmids. These plasmids behave as accessory 
bacterial chromosomes and can be transferred by 
bacterial conjugation. Their plasmids can be ex- 
changed via a tubular connection between two 
bacterial cells, and resistance to antibiotics can 
become widespread to relative as well as nonrelative 
species. Another way of transferring genetic material 
between two species of bacteria can be accomplished 
by transduction by a bacterial virus (bacteriophages or 
phage). This partial hybridization by exchange of 
genetic material, however, is not only restricted to 
bacteria. In case of bacterial transformation (by, e.g., 
Agrobacterium tumefaciens [Smith & Townsend, 1907] 
Conn [1942] or A. rhizogenes [Riker et al., 1930] Conn 
[1942]), genetic material may even be transferred from 
a prokaryote to a higher plant species. 

A “new” guiding principle. Phylogeneticists were 
aware early on that species of hybrid origin might pose 
special problems (Bremer & Wanntorp, 1979; Funk, 
1981), and this was the subject of a symposium at the 
second meeting of the Wili Hennig Society 
(Humphries, 1983; Nelson, 1983; Wagner, 1983; 
Wanntorp, 1983). Funk (1985), however, was the first 
to propose that analyzing correlated homoplasies in 
analyses of clades containing species of hybrid origin 


K K L 


PACT and phylogeography 2. PACT analysis of three gene trees shown in Figure 14. 


could be used to detect the hybrid species. She noted 
that when there are multiple equally parsimonious trees 
for a clade suspected of containing species of hybrid 
origins, those trees often represent a small number of 
distinct topologies. Furthermore, inspection of the trees 
indicated that most of the homoplasy and, thus, most of 
the ambiguity was contributed by a small number of 
species. Removal of those species from the analysis 
produced a stable topology and reduced homoplasy. 
Funk’s insights, combined with PACT, form the basis 
for an approach to phylogenetic analysis under 
Assumption 0. 


A new research program for phylogenetic analy- 
ses. Within the realm of an ontologically complex 
research program, phylogenetic analyses of entities 
show cladogenetic as well as reticulated relationships, 
and the following two steps are applied. 

Step I—PACT analysis of all available data. 
Convert all transformation series into character-state 
trees, then combine them using PACT (Figs. 16, 17), 
ignoring the plesiomorphic state. This produces a 
phylogenetic tree in which all traits interpreted as 
homoplasy by standard Hennigian analysis will be 
interpreted as homologies resulting from horizontal 
transfer/hybridization. 

Step 2—Explaining reticulations. There are two 
different, yet intertwined, issues involved in this step. 
The first is differentiating reticulated homology from 
true homoplasy. The second is differentiating hori- 
zontal transfer from hybrid speciation. 

The key to deciding the first issue is having 
phylogenies based on as many characters and 


character types as possible—comprehensive total- 
evidence studies. We would then expect, as suggested 


by Funk (1985), that cases of hybrid speciation would 
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17 


—16. PACT and analysis of hybrid species. Top four are binary character-state trees indicating 


distribution of apomorphic states. Bottom three are PACT results of (from left) combining | and 2, then (1 + 2) and 3. then (1 + 
2 +3) and 4. —17. PACT and analysis of hybrid species. Top 2 diagrams are gene trees; bottom diagram is PACT result for 


combining the gene trees. Parental species are sister species. 


appear in a PACT analysis as duplications supported 
by many characters (because hybrid speciation 
involves combinations of whole genomes), whereas 
cases of true homoplasy would appear as duplications 
supported by single characters (because homoplasy 
involves single characters). For such cases, relying on 
epistemological parsimony would have an ontological 
justification. Consider this thought experiment: 

One species (Z) forms a sister relationship with 
each of two other species (A and B) in a PACT 
analysis. If you have one case of (BZ) and one of (AZ), 
there are three possible explanations: (1) hybrid 
speciation (Assumption 0 default); (2) one case of 
horizontal transfer (A is the true sister species of Z, 
with horizontal transfer to B or vice versa); or (3) one 
case of true homoplasy (independent origin). Using 
epistemological parsimony alone, there is no way to 
choose, based on the evidence al hand. 


Next, suppose you have five cases of (BZ) and five 
cases of (AZ). Now the explanations become (1) one 
case of hybrid speciation; (2) five cases of horizontal 
transfer or one to four cases of horizontal transfer of 
linked traits; and (3) five cases of independent 
evolution or one to four cases of “co-adapted trait 
complex” or “adaptive syndrome” evolution. In this 
case, epistemological parsimony would lead us to 
choose hybrid speciation unambiguously as the 
preferred explanation. That means we hypothesize 
that all shared similarities among the species are 
homologous. 

This approach is not guaranteed 10 produce correct 
results, of course. It works when enough of the 
heritage of each parental species is expressed in the 
hybrid product(s). We expect that the more distantly 
related the parents are, the less likely we will be able 
to recover the parental species, but also the less likely 
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that hybrid speciation will occur. In addition, we often 
expect hybrid species to segregate phylogenetically 
more with one parent than another (see McDade, 
1990, 1992; Mindell, 1992, 1993; Zrzavy & Skala, 
1993; Skala & Zrzavy, 1994). For cases in which 
ambiguity in phylogenetic analysis results from a 
small number of species and when there is abundant 
evidence of correlated homoplasy, Funk’s approach 
gives us an excellent first approximation as to the 
possibility that the ambiguity is due to those species 
being hybrids, justifying the use of Assumption 0 
analysis. However, when larger numbers of species 
are involved, the explanations for character ambiguity 
may become very complex. 

For example, recent studies of prokaryote phyloge- 
netics have raised the possibility that this propensity 
on the part of prokaryotes has produced some wildly 
reticulated phylogenetic relationships. Some have 
even expressed the opinion that this means we should 
change our view of phylogeny, from a tree to a 
reticulated network (Doolittle, 2000). Maynard Smith 
and Szathmary (1995), however, pointed out that 
genetic machinery associated with storing and 
transmitting information shows no similar evidence 
of horizontal transfer. Thus, the true phylogeny may 
still be tree-like in pattern, although substantial 
homoplasy in metabolic genes may be present due 
to horizontal transfer. 

Regardless of the eventual outcome of this debate, 
we do not have to develop fundamentally different 
approaches to assessing the evidence about relation- 
ships. It simply means that the more horizontal 
transfer there has been, the more we will have to 
resort to species duplications to find the source of 
those transfers. Much more work, including substan- 
lial modeling effort, is needed to help determine when 
it is appropriate to use Assumption 0 in phylogenetic 
analysis and when it is nol. 

We will 


Assumption O and PACT are applied in this context, 


now address the second issue. When 


all reticulations are taken as evidence of introgression, 


but introgression can produce multiple outcomes, of 


which speciation is only one. Differentiating the 
various forms and outcomes of introgression requires 
a robust and unified ontological species concept (the 
evolutionary species concept; Simpson, 1944, 1951; 
Wiley, 1978) as well as a broad-based epistemological 
view of the diversity of kinds of species. As we 
suggested earlier, the PACT analysis is the beginning 
of an explanation, not an explanation in itself. 
Inference of species boundaries requires additional 
tests (such as nested clade analysis) to highlight the 
independent lineages and experiments to determine the 
strength of cohesion of those lineages (for a discussion 
and protocol, see Brooks & McLennan, 2002). 
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Finally, we come to the issue of how to incorporate 
reticulated phylogenetic events into our classifica- 
tions. There are two choices. Do we classify a tree with 
reuculations, or do we try to classify a reticulated 
network? If we choose the former, hybrid species will 
appear in two different clades, reflecting their 
phylogenetic origins. This disturbs traditional classi- 
fications in which each taxon has a unique place. If 
we choose to try to classify a network, each species of 
hybrid origin would be classified separately from 
either parental species, so the resulting classification 
would maintain the tradition of each taxon having a 
unique placement, but could not provide information 
about the phylogenetic relationships of the hybrid 
species. If we wish our classifications to reflect what 
we think we know about evolution, it seems that we 


will have to opt for the first alternative. 


CONCLUSIONS: BACK TO THE FUTURE 


Evolution is historically contingent and complex. 
Darwinism provides an ontology of complexity. Histor- 
ical contingency requires phylogenetic information to 
document and explain patterns. Representing this 
complexity so that we have some understanding of what 
needs to be explained requires a method that allows 
reticulated relationships to be found. A new algorithm, 
PACT (Wojeicki & Brooks, 2004, 2005), has been 
developed for that purpose. Applying this algorithm in 
historical biogeographical, coevolutionary, phylogeo- 
graphical, and phylogenetic studies invokes Assump- 
tion Q: all data must be analyzed without modification. 
Thereby, the resulting patterns of related areas, species, 
and populations contain duplications of entities to such 
an extent that these patterns are logically consistent 
with all input data. Further inference of these 
duplications asks for an ontology of complexity in 
which multiple evolutionary processes resulting in both 
divergent and reticulated patierns nest comfortably. 

Methods for phylogenetic analysis that only deal 
with divergent patterns and do not allow duplication of 
entities disallow complex evolutionary processes for 
explaining the convergent (reticulated) patterns and 
thereby oversimplify the historical relationships 
among the entities of analysis. Furthermore, several 
studies mentioned in this paper show that several 
older and newer concepts like the taxon pulse (Erwin, 
1979), ecological fitting Janzen, 1985b), phylogenetic 
analysis of hybrids (Funk, 1985), and the oscillation 
hypothesis (Janz & Nylin, 2007) allow for explanation 
of reticulated patterns. 

In order to perform comparative analyses based 
upon an ontology of complexity, we do not have to 
begin again because Darwinism is based on such an 
ontology. We do, however, have to move forward, 
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beyond the oversimplification of evolution associated 
with much of 20th-century evolutionary biology. Basic 
principles of phylogenetic systematics, coupled with 
Assumption 0, can accommodate analyses of complex 
evolutionary associations al various levels of organi- 
zation. Once we begin doing this, we will recapture 
earlier ideas and integrate them into a renewed 
appreciation for the Darwinian panorama. 
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